i ayior, D.W. 2014. Large inequalities in herbarium specimen density in the western United States. Phytoneuron 2014- 
53: 1-8. Published 2 June 2014. ISSN 2153 733X 

LARGE INEQUALITIES IN HERBARIUM SPECIMEN DENSITY 

IN THE WESTERN UNITED STATES 

Dean Wm. Taylor 

Redwood Drive 
Aptos, California 95003-2517 
deanwm taylor@gma:il. com 

ABSTRACT 

Herbarium specimen records from 11 western states of the IJSA are assayed (5.6 million 
specimen records obtained from on-line databases). On a county basis, specimen density varies 
considerably, by a factor of —2500 (highest in San Francisco County, California, at 99.2 
specimens /knf; lowest in Blaine, Liberty, and Prairie counties, Montana, at 0.04 specimens/km"). 
California exhibits the highest specimen density, nearly 2x higher than the second highest state 
(Wyoming). All states except Nevada have one to several counties which fall above the 80 U1 
percentile for density. Many states have large areas below the 20 th percentile. Specimen density is 
correlated with human population density. Large areas deficient in specimen records indicate 
challenges for vascular plant systematics and conservation. A comprehensive, adequately funded 
program of floristic exploration in the western USA would require gathering of an additional -1.5-2 
million specimens to minimally rectify the observed inequalities. 


Herbarium specimens are the fundamental currency of biodiversity studies. In the intellectual 
economy of biological systematics, classifications are products, specimens their raw materials. In the 
USA, herbaria have made great progress in digitization of specimen records in their holdings, 
supported by modest funding. Production of regional floras such as the revised Jepson Manual 
(Baldwin et al. 2012) and Flora North .America North of Mexico (FNA 1993-2006) have been greatly 
facilitated by digitization of herbarium records. However, the increased access to digitized specimen 
records is in contrast to the overall decline in specimen collecting (Prather et al. 2004a,b; Tewksbury 
etal. 2014). 

This paper identifies a significant inequality of herbarium specimen density in the western 
USA, based on accession of digitized records. 

Methods 

I queried on-line specimen databases for herbarium records of vascular plants by county for 
the 414 counties in 11 western USA states between 10 January and April 4, 2014. Records from 
regional consortia were tallied first (Consortium of C alifornia Herbaria, Consortium of Pacific 
Northwest Herbaria, Southwest Environmental Information Network), then records from larger 
individual herbaria were added (CAS, RM, COLO, MINN and MO-Tropicos — herbarium 
abbreviations follow r Thiers 2014). Queries were structured to eliminate specimen records offered by 
more than one data provider across platforms. Data on spelling of county names, state and county 
land surface areas, and human population density (2010 census) were obtained ifom Wikipedia 
(accessed January 2014). Several variant county spellings were corrected in the queries. Minor 
internal inconsistencies in the area of counties and states were not corrected (these in no case are 
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greater than about 20 km ). Pres entation and analysis of the data is given as percentiles for simplicity 
of comparisons. 
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Results and discussion 
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Observed specimen density (specimens/km") diners by a factor of 2480, ranging between 

99.2 (San Francisco County, California) to 0.04 (Blaine, Liberty, and Prairie counties, Montana). 

Using the breakdown criteria of Tukey (1977), 33 counties are outliers relative to the other 381 

counties: all outliers are enriched in specimen density. The mean specimen density of these 33 
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enriched outliers is 12.27 specimen s/km". The mean for the 381 counties that are not outliers is 1.65 
specimens/km" , smaller by a factor of 7. Figure 1 provides a frequency distribution of the density for 
the 414 counties: the distribution is highly left-skewed. 

The inequalities in specimen density have a strong geographic pattern (Figure 2, Table 1). 
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Relative to other states, California is well-collected, with an average density of 4.76 specimens/knV 

(1.9 million digitized specimen records). Allowing for the fact that California herbaria are not yet 
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fully digitized (CAS and DAY remain to finish), the actual density might approach 6 specimens/km 
(based on reported Consortium of California Herbaria digitization progress reports). By contrast, 
Montana and the Great Bas in states of Nevada and Utah are comparably depauperate (Table 1). 

All states have both wdi 1-collected (defined as >80* percentile) and under-collected <defined 
as <50 t ! percentile) counties (Figure 2). Some of the deficient counties are remote from population 
centers (Utah, Carbon, and Emery counties; Nevada, all but Carson City; Crook County, Oregon, etc., 
cf. Figure 2 ). Other deficient regions include counties with a large fraction of the land area devoted 
to agriculture (e.g. Jerome and Minidoka counties, Idaho; counties in eastern Colorado). Other 
deficient counties, although agricultural, include sizeable remnants of non-managed vegetation (most 
of the deficient counties in Montana; Jefferson County in Idaho; Adams, Franklin, and Lincoln 
counties in Washington). The frequent low specimen density in rural counties also suggest that weed 
records might be disprop ortionately lacking therein. 

Regression analysis identified a significant correlation between specimen density and density 
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of human population based on the 2010 census (R 0.66, p<0.001) — counties with a greater human 

population are well-collected compared to sparsely populated counties. Moreover, the geographic 
location of enriched vs. deficient counties suggests that proximity to herbaria and universities, which 
are located in population centers, is important. Such a pattern is not that unexpected: botanists gather 
specimens close to home. 

The western USA states fall perhaps into three groups (Table 1). California, Colorado, and 
Wyoming have high to moderate specimen density, Washington and Arizona intermediate density, 
with the remaining states trailing behind (Nevada far in the rear). 

Table 1 summarizes two potential criteria upon which the adequacy of regional specimen 
density could be assessed. If states are judged relative to a performance standard based on the mean 
specimen density for California, nearly 9 million additional specimen records would be necessary to 
attain this data density elsewhere over the region. Alternatively, if states are judged against the 70 th 

■ -r- m p ■ ■ ■ O " " " 

percentile for non-California counties (about 2 specimens/km"), the resultant specimen deficit is a 
mere 1.8 million specimens ('fable 1). By this latter standard California, Wyoming, and Colorado are 
well-collected, with Washington, Arizona, Idaho, Oregon, New Mexico, Utah. Montana, and Nevada 
deficient to varying degrees. 

For convenience, readers who know the specimen number or density for a particular county can 
approximate percentile using the regression equation: percentile = 0.22461n (density) + 0.4251 
(#=.939) 
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Specimen Density 
(sheets/km2) 



Deficit Comparison 

STATE 

Mean 

High 

Low 

Specimens 

Area 

(km 2 ) 

Calif. 

Sta nd a rd 

70% tile 

Non-CA 

Standard 

California 

4.76 

99.19 

0.35 

1,922,851 

404,224 

0 

+1,118,445 

Wyoming 

2.75 

5.82 

0.83 

695,570 

253,348 

-510,366 

+191,407 

Colorado 

230 

14.74 

0.06 

620,782 

269,837 

-663,642 

+83,806 

Washington 

1.92 

26.84 

0.03 

355,227 

184,827 

-524,550 

-12,579 

Arizona 

1.84 

6.21 

0.67 

542,300 

295,234 

-863,014 

-45,216 

Idaho 

1.43 

7.58 

0.08 

310,419 

216,632 

-720,749 

-120,679 

Oregon 

1.34 

10.73 

0.36 

340,518 

255,026 

-873,406 

-166,984 

New Mexico 

1.01 

10.98 

0.05 

317,580 

315,194 

-1,182,743 

-309,656 

Utah 

0.82 

3.76 

0.16 

179,546 

219,887 

-867,116 

-258,029 

Montana 

0.81 

4.86 

0.04 

310,181 

380,800 

-1,502,427 

-447,611 

Nevada 

0.34 

2.25 

0.13 

96,696 

286,367 

-1,266,411 

-473,174 

Grand Mean 1.77 



5,691,670 

3,081,376 

-8,974,425 

-1,833,927 


Table 1. Number of databased specimens by state, specimen density, and deficit comparisons based on the 
specimen density for California, and for the 70 -percentile county density for non-California counties. 
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Figure 1. Frequency distribution of specimen density for 414 counties in 11 western states of the USA. The 
distribution is strongly left-skewed (the mean density is 1.77, Table 1). 


Arizona — Only one county, the smallest in the state, Santa Cruz County, is well-collected, with 
adjacent Pima and Cochise counties intermediate. Notable in this region are the high mountains with 
Madrean floristic affinities (McLaughlin 1994), which are regarded as a biodiversity hot-spot 
(Mittermeier et al. 2011) and which are under active study (see Felger et al. 2014). Two of the 
deficient counties ( Yuma and La Paz) are in the western Sonoran desert region: adjacent Imperial 
County, California, and Clark County, Nevada, have similarly low density, making a regional 
deficiency. Navajo County is likewise deficient — its far northern portion is included in the recent 
Four Corners Flora (Heil et al. 2013). 
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Percentile (density) 

■ >80 (3.5) 

□ 79-65(2.4-3.4) 
64-51(2.3-1.4) 

□ 50-36(1.3-0.9) 

□ 35-20(0.8-0.4) 

□ <20(<0.39) 


Figure 2. County map of the western United States showing herbarium specimen density (6 mapped categories 
based on percentile of observed county density variation). 


California - In California, which is generally well-collected. Kings County stands out as one of the 

2 ii 

least collected counties in the western USA (0.32 spedmens/knf, 15 percentile). It would be 
attractive to attribute the lack of specimens from Kings County to the fact that a large portion of the 
county lies in the San Joaquin Valley, the most intensively agrarian landscape in North America 
(Preston 1981). However, the western third of the county is mountainous, situated in the inner South 
Coast Ranges, suggesting the lack of specimens is not due to a lack of species richness in the flora. 
The recently described monotypic Brassicaceae genus Twisselmarmia (Al-Shebaz 1999), a narrow 
endemic to Kings County, suggests that the low collection density for that county may owe, at least in 
part, to under-collection 
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The map in Figure 2 shows that California contains a disproportionate number of well- 
collected counties. Moreover, non-political geographic subdivisions of the state are also high density 
outliers: Yosemite National Park (33,214 specimens) has a density of 11 specimens/knY, equal to 97 t!1 
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percentile for counties. Yosemite Valley (±20 km", 5843 specimens) has an extreme density, about 
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275 specimens/km''. The California Channel Islands are also all extreme high density outliers: 
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Anacapa Island (911 specimens/km"), Santa Barbara Island (488), Santa Cruz Island (87), Santa 
Catalina Island (60), San Nicolas Island, and Santa Rosa Island (28 specimens/kut) all exceed the 
99 th percentile equivalent for county density. 

Colorado — third to California in collection density (Table 1), the strong pattern of well-collected 
counties in the mountains along the Front Range and in the headwaters Colorado River basin is in 
direct contrast to the poorly collected counties in the eastern Great Plains. Elev en counties exceed the 
9G M percentile: Gilpin, Boulder, Clear Creek, San Juan, Grand, Denver, Lake, Larimer, Gunnison, 
Jackson, and Jefferson. Although the recently published Four Corners Flora (Heil et al. 2013) covers 
a geographic region where some of the counties are well-collected, overall the collection density of 
that flora region averages the 58 th percentile. The eastern plains portion of the state average in the 7 th 
percentile (in this region, Otero County is better collected, 51 s1 percentile). These eastern counties are 
largely agricultural and sparsely populated: low specimen density is opposed to the prevalence of 
grazing lands used for intensive livestock grazing, suggesting poor floristic documentation. The same 
west to east gradient in specimen density is also seen in adjacent New Mexico. 

Idaho — overall collecting density is more uniform than adjacent states, but most counties fall in the 
lower range (<50 th percentile). Overall, the density pattern is similar to Oregon, but there are still 
several marked inequalities. The low collection density in the western portion of the Snake River 
plains (Lincoln, Jefferson, Minidoka, and Jerome counties) indicates a significant regional deficit, 
perhaps attributable to an intensive agricultural district. The high concentration in Latah County (85 fl ‘ 
percentile) is part of a regional pattern connected to adjacent Washington (associated with proximity 
to WTO and nearby ID/IDF). 

New Mexico — almost all of the state falls below the 35 d ' percentile, with the far eastern portion below 
the 20 i; percentile (the easternmost deficient counties mirror the same pattern in Colorado). The sole 
well or moderately collected counties are in the north in the Rocky Mountains. 

Nevada — relatively a ‘third-world 5 state, the least collected state of any in the western USA. The 
recently finished Intermountain Flora (Holmgren et al. 2012) essentially covers most all the white 
space of the map in Figure 2. It should be noted that digitized specimen records for the Intermountain 
Flora region concentrate on the last two published volumes (Holmgren et al. 2005, 2012) and as such, 
on-line specimen density for the region is artificially low, although it is unlikely to be of notable 
greater overall magnitude. The only counties with even a modicum of specimen density are those in 
the Sierra Nevada and its eastern foothills adjacent to California (Carson City, Douglas, and Storey 
counties, in the Reno region). 

Montana — herbarium specimen density for most of the state is dismally low, relatively. Half of the 
counties in the state are below the 20 th percentile, and nearly 90% of the state is below the 50 t!1 
percentile. Only three counties (Carbon, Gallatin, and Park) have high density (>80 l ! percentile), 
these in the Rocky Mountains (proximal to MONT and more significantly RAT) and adjacent to much 
better collected Wyoming. The large deficient counties are sparsely populated but are not entirely 
given over to intensive agriculture; they retain sizable remnant habitat areas of non-managed 
vegetation. 
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Oregon — specimen density is most distinctly divided regionally west and east of the Cascades, with 
the highest density in urban Multnomah and Benton counties (vicinity of ORE/OSU). The far 
southwestern counties (Josephine, Curry, and Jackson) are well-collected, being part of the California 
Floristic Province. The moderate density in the Wallowa Mountains of the far northeast is also of 
note. 

Utah - significantly under-collected relative to the overall mean density (Table 1). The only two 
counties with high density (Daggett and Cache) are in the Wasatch Mountain province. Washington 
County in the far southwest is moderately well-collected, at 74 tn percentile. Most of the Bonneville 
Basin in the western half of the state is very significantly under-collected (<20 ih percentile). The state 
flora, now in its 4 trt revision (Welsh et al. 2008), indicates slow but steady progress in floristic 
documentation, but die low overall collection density suggests opportunities for fruitful future 
floristic exploration. 

Wyoming — the second best collected state in the data set. The 6 counties that comprise the Rocky 
Mountain portion of the state - T eton, Albany, Park, Sublette, Lincoln, and Crook, form a large well- 
collected block exceeding the 8G tf ‘ percentile. By comparison to other western states, Wyoming has 
comparably few poorly collected counties: Niobrara County is the lowest at the 33 ld percentile. 
Compared to ecologically similar Montana, die differences in collection density amongst comparable 
physiographic regions in the two states are manifold. 

Conclusions 

Large areas of the North American continent extremely deficient in herbarium specimen 
records indicate significant challenges for vascular plant systematics and conservation in the USA 
Bebber et. al (2010, 2012) have quantitatively demonstrated that description of new species of 
vascular plants is disconnected from their initial discovery. That is, new species are most often 
"discovered” in herbaria — specimens of undescribed taxa reside in collections for several decades 
before they are recognized and treated and their initial recognition requires a threshold number of 
accumulated specimens. The nearly 2500-fold difference in collection density reported in this paper 
strongly implies that undiscovered plant diversity in under-collected regions is concealed. Given this, 
going forward, preparation of floristic treatments will suffer from the unequal distribution of 
specimens. 

Bebber et al. (2012) also have shown that contribution of type specimens is non-randomly 
distributed among collectors. Their analysis of the dynamics of plant collecting shows that long 
apprentice times are required to develop expertise in floristic botany. Accordingly, the apparent 
decline in practicing taxonomists (Whitfield 2012; Bacher 2012) is in stark contrast to the large large, 
under-collected areas of Figure 2. Who will collect the requisite 2 million specimens to fill in the 
map? 

The fact that California is disproportionately collected relative to the 10 other western USA 
states — arguably ‘over-collected 7 — would seem to indicate that no further specimens need be 
gathered there. That such a view would be folly has been convincingly demonstrated by Joppa et al. 
(2011), who show that biodiversity ‘hot-spots’ house most undescribed plant species. The California 
region is such a ‘ hot-spot’ of vascular plant diversity. In the roughly two decade interval between the 
1 st and 2 nd editions of the Jepson Manual (Hickman 1993; Baldwin et al. 2012), about 163 new 
California endemics were described, and about 373,000 additional accessions were made after initial 
publication of a revised, modern flora (specimen data estimate based on Consortium of California 
Herbaria). Even well-collected California needs attention from floristic botanists. 

Common sense indicates that it is unlikely that botanists can muster the effort, given available 
resources, to gather 9 million additional specimens from the western USA I believe, however, that it 
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is not unrealistic to gather and/or database an additional 1.8 million regional specimens (to equal the 
70 -percentile standard, see Table 1). Attaining such a goal would require significant increase in 
floristic field studies, with increased and adequate funding. A comprehensive, adequately funded 
program of floristic exploration in the western USA is necessary to rectify the observed inequalities. 
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